Skip to content

Conversation

@devin-petersohn
Copy link
Contributor

What changes were proposed in this pull request?

Improve the performance of various metadata and precomputing operations in pandas by using list comprehension.

Why are the changes needed?

Performance and maintainability

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI

Was this patch authored or co-authored using generative AI tooling?

No

…ension

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
Co-authored-by: Devin Petersohn <devin.petersohn@snowflake.com>
@github-actions
Copy link

github-actions bot commented Jan 6, 2026

JIRA Issue Information

=== Bug SPARK-55025 ===
Summary: Pyspark pandas use of nested for loops
Assignee: None
Status: Open
Affected: ["4.1.1"]


This comment was automatically generated by GitHub Actions

Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
Signed-off-by: Devin Petersohn <devin.petersohn@gmail.com>
@holdenk
Copy link
Contributor

holdenk commented Jan 12, 2026

@devin-petersohn we should make a new JIRA for this since the old one is already resolved.

@devin-petersohn devin-petersohn changed the title [SPARK-54787][PS] Improve performance in pandas by using list comprehension [SPARK-55025][PS] Improve performance in pandas by using list comprehension Jan 13, 2026
@devin-petersohn
Copy link
Contributor Author

Sorry about that! Fixed the title and created a new issue.

Copy link
Member

@HyukjinKwon HyukjinKwon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I guess this is good to go? wdyt @gaogaotiantian @holdenk

Copy link
Contributor

@huaxingao huaxingao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@huaxingao
Copy link
Contributor

Merged to master! Thanks @devin-petersohn for the PR! Thanks @holdenk @HyukjinKwon @gaogaotiantian for the review!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants